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ABSTRACT 

Regression lines for the prediction of Wide Range 
Achievement Test (HRAT) scores by Hechsler Intelligence Scale for 
Children (HISC) and HISC Revised (HISC-R), intelligence quotients were 
compared across race, by the Potoff technique. Subjects were 36 black 
and 30 white children who were referred for psychological evaluation 
because of learning or behavior problems. Regression lines for blacks 
and white did not differ significantly for the prediction of WRAT 
scores by either the HISC or WISC-R. Results lend general support for 
the use of a common regression line in the prediction of achievement 
scores for the two races* (Author/GDC) 
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Abstract 

Regression lines for the prediction of WRAT scores by 
WISC end WISC-R FSIQ's were compared across race, by the 
Pot thoff technique, for 36 black and 30 white referred x 
children. Regression lines for blacks and whites did not 
differ significantly for the prediction of WRAT scores 
by either the WISC or WISC-R. Results lend general sup- 
port for the use of a common regression line in the pre-^ 
diction of achievement scores for the two races. 
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the use of psychological tests normed' primarily 
with white children for psychological diagnosis and ed- 
ucational dec is ion -making concerning minority children, 
has come under increasing scrutiny in recent years. Al- 
though much discussion of the issue has appeared in boc,h 
the scientific and public literature, few data of rele- 
vance to the -issue (with school age children) have been > 
presented. The use of such tests are of special concern 
to the school psychologist, particularly in view^of the 
Larry P. case and PL 94-142. Harrington (1976, 1975) has 
gone so far as to state that it is not possible for tests 
developed an^l normed on a white majority to be other than 
biased against minorities and to show less predictive •* 
validity when used with minorities. ; 

In response to pressure from the Black Psycholo- 
gical Association (which was actually requesting a mora- 
torium on the use W psychological and educational. tests 
with disadvantaged students) , the APA Board of Directors 
requested, in; 1968, the Board of Scientific Affairs to 
appoint a group to study ^he use of such tests with dis- 
advantaged students. In reporting on this issue, the 
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cotmnittee (Cleary, Humphreys, Kendrick, and Wesman, 1975) 
of fared a^definition of test bias. While including con- 
tent and construct validity as important variables in the 

7 \ 
issue of test bias , the focus was clearly on predictive 

i 

validity: 

A teat ia considered fair for a particu- 
lar^use^if the inference drawn from the test I 
~~ >( score is made with the smallest feasible ran- I 
dom error and if there is no constant error * 
in the inference as a function of membership 
in a particular group. Q?leary et al., 1974, . v _^^ 
P..25J • . "N 

The definition of bias offered by the APA committee is a 
^restatement of previous definitions by Cardall end 
Coffman (1964), Clearyyd968) T and Potthoff (1966), an<f 
has been widely accepted (though certainly not without^ 
criticism, e.g. , Bernal, 1975; Linn & Werts, 1971; Thorn- 
&dike, 1971). Oakland and Matuszek^ (1977) examined class 
placement procedures under several proposed models of bias 
and demonstrated that the Cleary model results in the 
smallest number of children being misplaced, although un- 
der certain legislative conditions, Oakland and Matuszek 
favored the Thorndike (1971) selection model. A statis- 
tical technique provided by Potthoff (1966). has also re- 
ceived widespread acceptance in the examination of regresr 
sion lines to test bias under the Cleary et al. definition. 

V 
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While considerable data are available on the va- y 
lidity of the SAT for blacks and whites, few studies 
have appeared dealing with differential validity of 1Q 
tests. Mitchell (1967) studied the validity of two broad 
based readiness tests to predict first grade achievement 
for blacks and whitgs finding similar validity coeffi- 
cients for the two "races. Mitchell's study was limited 
to comparing the magnitude of independent-dependent vari- 
able correlation and did not look for identity of regres- 
sion lines. Hartlage. LuCas, and Godwin (1976) compare^ 
the predictive validity of the WTSC and Raven with a group 
of low SES, disadvantaged children. When comparing what 
they considered to be the relatively culture-fair test, 
the Raven Matrices, with the "culture-loaded" 1949 WISC, 
Hartlage et al. (1976) found the WISC to have consis- 
tently larger correlations with measures of reading, ^ 
spelling, and arithmetic than the Raven.' These authors 
only compared the strength of the relationship in each 
case and did not look for identity of regression "lines 
(equivalent beta coefficients and intercept constants). 

Jensen (1976) and Jensen and Figueroa (1975) in- 
vestigated the cons^uct and content validity of several 
popular psychometric instruments across race, including 
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the WISC-R. Jensen (1976; and Figueroa, 1975) concluded, 
finding no differences Co support claims of bias, the 
test 8 evaluated were equally valid for use witn blacks, 
whites, and chlcanos. While Jensen's researcn clearly 
supports the equivalency of Internal psychotfi^tric pro-* 
pertles of e%ch Instrument Investigated across race,, 
Jensen neglecj^ed to Investigate the predictive validity 




of the various measures for blacks, whites, and chlcanos, 
causing his conclusions to be at best premature. The 
major purpose of the present study is to begin providing 
data which will aid in the empirical evaluation of test - 
bias under the Cleary et al, (1975) definition for one 
of the mostly widely used assessment - instruments , the 
WISC-R, and its predecessor, the 1949*WISC. 

Method 

Subj ects ^ 

Thirty white and 36 black school age^ children re- * 
f erred for psychological evaluation for a wide -range of 
learning/behavior disorders comprised the sample.' Sample 
characteristics are described in detail in table IT 
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Children were selected ^or/ inclusion randomly, by a certl- 
fied^scl^ool psychologist, from referrals received with- 
in a public school system. The psychologist ^electing 
subjects and conducting the evaluations had no knowledge •' 
of the purpose of the study. 

Procedure 

All children were tested on the WI$C and.WISC-R 
in a counterbalanced design; the time between testings; 
ranging from six weeks to several mq^tl^s. The. Wide 
Range Achievement Test (WRAT) /Reading and Arithmetic 
subtests were administered at. the time of the second 
testing. Pearson product -moment correlations between the 
WISC and WISC-R Full Scale IQ's and WRAT Reading and 
Arithmetic subtest^ were computed for the total sample 
and separately by race. Two methods of examining differ- 
ential validity (or bias) were employed^, 

The regression lines for each pair of scores weire 
examined by race through the Potthoff (1966) technique. 
The Potthoff technique Is widely accepted and provides a 
simultaneous test of regression coefficients and inter- 
cept values to determine., whether two regression lines are 
equivalent (in this case the b]Lack regression line and the 
white regression line) . The magnitude of the correlations 
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between e*oh set oi independeht -dependent variables for 
the twp racvs was examined by testing' the difference be-t 
tWeen the correlations for the two groups. The method 
chosen is based on Fisher Z> transformations and ,the 
standard error of the correlation. The secondTset ,of ■ 
analyses are related to the comparison of regression co- 
efficients encountered in the Potthoff analysis and their 
computation is somewhat redundant. .The second set of • 
analyses is similar* to methods used by Mitchell (1967) 
and Hartlage et al. (1976) and were computed to deter- 
mine whether different conclusions would have been ob- 
tained had only the strength of the relationship between 
independent -dependent variable pairs been examined across 
race. Due to the sensitivity of the Potthoff procedure 
and* the number (10) of comparisons being made, a con- 
servative p of .01 was adopted. 

Results and Discussion 

Regression lines for blacks and whites did not 
differ significantly for the prediction of WRAT Reading 

-and Arithmetic scores by either the WISC or WISOR. Re- 
gression lines for predicting WISC-R IQ^s from WXSC IQ's 

■'for blacks and whites also showed a high degree of 
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similarity* Table 2 presents a summary of the result* of 
che F-tests for homogeneity of regression derived from 
application of Fotthoff % s (1966) formulation. The pre- 
sent results are generally consistent with previous 

Insert table 2 about here 



investigation^ of emplbyment tests (Bartlett & 0 'Leery., 
1969; Einhorm & Bass, 197*) and the prediction of college- 
success with the SAT (Cleary, 1968; Kallingal, 1971; 

I 

Pfeifer & Sedlacek, 1971). 

When only the magnitude of independent-dependent 

variable relationships were examined, as in some previous 

studies (Hartlage et al..l976; Mitchell, 1967) , 4 no sig- 

> 

nificant difference occurred between the correlations of 
the WRAT subtests with the WISC and WISC-R Full Scale 
IQ's for black- and white children. Correlations between 
the two WRAT subtests and beftween the WISC and WISC-R 
were also highly similar. : These data are summarized in 
table' 2. i - 

Although nc)t dealing with the social issues and 
consequences surrounding intelligence testing , the pre- 
sent results do provide support for the use of a cotnmon 
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regression line to predict achievement scores for black 
end white children with the popular WISC and WISC-R. 
Since homogeneity of regression is commonly assumed in 
the actual practice of clinical and educational assess- 
ment*, data such as that presented herein should become 
increasingly important . Prior to drawing firm conclu- 
sions regarding the differential predictive validity of 
the WISC or WISC-R across race, much lhore research la 
needed utilizing a wide variety of criteria, including 
other individual achievement tests, group achievement 
measures, teacher -made tests, and teacher ratings of . 
achievement. Perhaps similar studies utilizing sex as 
the major variable will also be useful. 
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Table 1 

Sample Characteristics by Race 











X 


SD 


X 


IQ 


X 


IQ 




N 


F 


M 


Age 


Age 


WISC-IQ 


SD 


W1SC-R 


SD 


Blacks 


36 


19 


17 


12.6 


1.3 


84.41 


12.11 


79.75 


10.68 


Whites 


30 


13 


17 


12.9 


1.8 


83.63 


19.37 


79.00 


19.87 


Total 


66 


32 


34 


12.7 


1.5 


84.06 


15.70 


79.41 


1,5.42 



1 
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Table 2 

► 

Summary tabulation of F-tests for similarity of black 
and white regression lines and comparisons of correla- 
tional values in the prediction of achievement. 



of Regression Lines 


F (2,62) 


P 




WISC FSIQ - WRAT Reading 


1.51 


N. 


S. 


WTSC-R FSIQ - WRAT Reading 


2.17 


N. 


S. 


WISC FSIQ - WRAT Arithmetic 


2.75 


N. 


s. 


WISC-R FSIQ - WRAT Arithmetic 


4.05 


N. 


s. 


WISC FSIQ - WISC-R FSIQ 


1.11 


N. 


s. 



Tests 


for Significance of 


Difference Between 


Correlations 






Jj> 




Z 


P 


WISC FSIQ 


- WRAT Reading 


.62 


.71 


0.44 


N.S. 


WISC-R FSIQ 


- WRAT Reading 


.40 


.77 


1.71 


N.S. 


WISC FSIQ 


- WRAT Arithmetic 


.64 


.63 


0.05 


N.S. 


WISC-R FSIQ 


- WRAT Arithmetic 


.69 


.72 


0.15 


N.S. 


1 

WISC FSIQ 


- WISC-R FSIQ 


.81 


.95 


1.02 


N.S . 
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